Attentive Pooling Networks

نویسندگان

  • Cícero Nogueira dos Santos
  • Ming Tan
  • Bing Xiang
  • Bowen Zhou
چکیده

In this work, we propose Attentive Pooling (AP), a two-way attention mechanism for discriminative model training. In the context of pair-wise ranking or classification with neural networks, AP enables the pooling layer to be aware of the current input pair, in a way that information from the two input items can directly influence the computation of each other’s representations. Along with such representations of the paired inputs, AP jointly learns a similarity measure over projected segments (e.g. trigrams) of the pair, and subsequently, derives the corresponding attention vector for each input to guide the pooling. Our two-way attention mechanism is a general framework independent of the underlying representation learning, and it has been applied to both convolutional neural networks (CNNs) and recurrent neural networks (RNNs) in our studies. The empirical results, from three very different benchmark tasks of question answering/answer selection, demonstrate that our proposed models outperform a variety of strong baselines and achieve state-of-the-art performance in all the benchmarks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Attentive Convolution

In NLP, convolution neural networks (CNNs) have benefited less than recurrent neural networks (RNNs) from attention mechanisms. We hypothesize that this is because attention in CNNs has been mainly implemented as attentive pooling (i.e., it is applied to pooling) rather than as attentive convolution (i.e., it is integrated into convolution). Convolution is the differentiator of CNNs in that it ...

متن کامل

Attentive Interactive Neural Networks for Answer Selection in Community Question Answering

Answer selection plays a key role in community question answering (CQA). Previous research on answer selection usually ignores the problems of redundancy and noise prevalent in CQA. In this paper, we propose to treat different text segments differently and design a novel attentive interactive neural network (AI-NN) to focus on those text segments useful to answer selection. The representations ...

متن کامل

Attentive Statistics Pooling for Deep Speaker Embedding

This paper proposes attentive statistics pooling for deep speaker embedding in text-independent speaker verification. In conventional speaker embedding, frame-level features are averaged over all the frames of a single utterance to form an utterance-level feature. Our method utilizes an attention mechanism to give different weights to different frames and generates not only weighted means but a...

متن کامل

Attention and feature integration in the feature inheritance effect

Features of neighboring elements are not processed independently. Often, it is assumed that nearby features are integrated by a (pre-attentive) pooling mechanism. Here, we show that in the feature inheritance effect some features are integrated across space whereas others are not. This result may be partly explained by a very focused spatial attention. Our findings challenge models based on a s...

متن کامل

Are juvenile domestic pigs (Sus scrofa domestica) sensitive to the attentive states of humans?--The impact of impulsivity on choice behaviour.

Previous studies have shown that apes, dogs and horses seem to be able to attribute attentive states to humans. Subjects had to choose between two persons: one who was able to see the animal and one who was not. Using a similar paradigm, we tested a species that does not rely strongly on visual cues, the domestic pig (Sus scrofa domestica). Subjects could choose between two unfamiliar persons, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1602.03609  شماره 

صفحات  -

تاریخ انتشار 2016